NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks

https://doi.org/10.1038/s41467-021-27222-7

Koehler Leman, Julia; Lyskov, Sergey; Lewis, Steven M.; Adolf-Bryfogle, Jared; Alford, Rebecca F.; Barlow, Kyle; Ben-Aharon, Ziv; Farrell, Daniel; Fell, Jason; Hansen, William A.; et al (December 2021, Nature Communications)

Abstract Each year vast international resources are wasted on irreproducible research. The scientific community has been slow to adopt standard software engineering practices, despite the increases in high-dimensional data, complexities of workflows, and computational environments. Here we show how scientific software applications can be created in a reproducible manner when simple design goals for reproducibility are met. We describe the implementation of a test server framework and 40 scientific benchmarks, covering numerous applications in Rosetta bio-macromolecular modeling. High performance computing cluster integration allows these benchmarks to run continuously and automatically. Detailed protocol captures are useful for developers and users of Rosetta and other macromolecular modeling tools. The framework and design concepts presented here are valuable for developers and users of any type of scientific software and for the scientific community to create reproducible methods. Specific examples highlight the utility of this framework, and the comprehensive documentation illustrates the ease of adding new tests in a matter of hours.
more » « less
Full Text Available
Modeling beta‐sheet peptide‐protein interactions: Rosetta FlexPepDock in CAPRI rounds 38‐45

https://doi.org/10.1002/prot.25871

Khramushin, Alisa; Marcu, Orly; Alam, Nawsad; Shimony, Orly; Padhorny, Dzmitry; Brini, Emiliano; Dill, Ken A.; Vajda, Sandor; Kozakov, Dima; Schueler‐Furman, Ora (January 2020, Proteins: Structure, Function, and Bioinformatics)

Abstract Peptide‐protein docking is challenging due to the considerable conformational freedom of the peptide. CAPRI rounds 38‐45 included two peptide‐protein interactions, both characterized by a peptide forming an additional beta strand of a beta sheet in the receptor. Using theRosetta FlexPepDockpeptide docking protocol we generated top‐performing, high‐accuracy models for targets 134 and 135, involving an interaction between a peptide derived from L‐MAG with DLC8. In addition, we were able to generate the only medium‐accuracy models for a particularly challenging target, T121. In contrast to the classical peptide‐mediated interaction, in which receptor side chains contact both peptide backbone and side chains, beta‐sheet complementation involves a major contribution to binding by hydrogen bonds between main chain atoms. To establish how binding affinity and specificity are established in this special class of peptide‐protein interactions, we extractedPeptiDBeta, a benchmark of solved structures of different protein domains that are bound by peptides via beta‐sheet complementation, and tested our protocol for global peptide‐dockingPIPER‐FlexPepDockon this dataset. We find that the beta‐strand part of the peptide is sufficient to generate approximate and even high resolution models of many interactions, but inclusion of adjacent motif residues often provides additional information necessary to achieve high resolution model quality.
more » « less
ClusPro in rounds 38 to 45 of CAPRI: Toward combining template‐based methods with free docking

https://doi.org/10.1002/prot.25887

Padhorny, Dzmitry; Porter, Kathryn A.; Ignatov, Mikhail; Alekseenko, Andrey; Beglov, Dmitri; Kotelnikov, Sergei; Ashizawa, Ryota; Desta, Israel; Alam, Nawsad; Sun, Zhuyezi; et al (March 2020, Proteins: Structure, Function, and Bioinformatics)

Abstract Targets in the protein docking experiment CAPRI (Critical Assessment of Predicted Interactions) generally present new challenges and contribute to new developments in methodology. In rounds 38 to 45 of CAPRI, most targets could be effectively predicted using template‐based methods. However, the server ClusPro required structures rather than sequences as input, and hence we had to generate and dock homology models. The available templates also provided distance restraints that were directly used as input to the server. We show here that such an approach has some advantages. Free docking with template‐based restraints using ClusPro reproduced some interfaces suggested by weak or ambiguous templates while not reproducing others, resulting in correct server predicted models. More recently we developed the fully automated ClusPro TBM server that performs template‐based modeling and thus can use sequences rather than structures of component proteins as input. The performance of the server, freely available for noncommercial use athttps://tbm.cluspro.org, is demonstrated by predicting the protein‐protein targets of rounds 38 to 45 of CAPRI.
more » « less

Search for: All records